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Subunit organization of the abalone Ha/zof/s tuberculata hemocyanin 
type 2 (HtH2), and the cDNA sequence encoding its functional units 
d f e # f, g and h 
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We have developed a HPLC procedure to isolate the two different hemocyanin types (HtHl and HtH2) of the 
European abalone HaUotis tuberculata. On the basis of limited proteolytic cleavage, two-dimensional Immuno- 
electrophoresis, PAGE, N-terminal protein sequencing and cDNA sequencing, we have identified eight different 
40-60-kDa functional units (FUs) in HtH2. termed HtH2-a to HtH2-h. and determined their linear arrangement 
within the elongated 400-kDa subunit. From a HaUotis cDNA library, we have isolated and sequenced a cDNA 
clone which encodes the five C-terminal FUs d, e, f, g and h of HtHl As shown by rnuidple sequence alignments, 
defg of HtH2 correspond structurally to defg from Octopus dofleini hemocyanin. HtH2-e is the first FU of a 
gastropod hemocyanin to be sequenced. The new HaUotis hemocyanin sequences are compared to their 
counterparts in Octopus, Helix pomatia and HtHl (from the latter, the sequences of FU-f. FU-g and FU-h have 
recently been determined) and discussed in relation to the recent 2.3 A X-ray structure of FU-g from Octopus 
hemocyanin and the 15 A three-dimensional reconstruction of the Megatkura crenulata hemocyanin didecamer 
from electron micrographs. This data allows, for the first time, an insight into the evolution of the two 
functionally different hemocyanin isoforms found in marine gastropods. It appears that they evolved several 
hundred million years ago within the Prosobranchia, after separation of the latter from the branch leading to the 
Pulmonata. Moreover, as a structural explanation for the inefficiency of the type 1 hemocyanin to form multi- 
decaraers in vivo t the additional N-glycosylation sites in HtHl compared to HtH2 are discussed. 
Keywords: evolution; Gastropoda; hemocyanin; KLH; Mollusca; primary structure. 



In molluscs, the blue, copper-containing respiratory protein 
hemocyanin has a molecular mass of 4 MDa and multiples 
thereof. These large molecules occur freely dissolved in the 
hemolymph of many gastropods and also of chitons, cephalo- 
pods and some bivalves. The basic 4-MDa unit is a ring-like 
decamer consisting in a wall of 35 nm in diameter and 18 nm in 
height, with an internal collar. In cephalopods, native hemo- 
cyanin is present exclusively as symmetrical decamers with a 
central collar, whereas cgiton hemocyanins are asymmetrical 
decamers with a peripheral collar. In bivalves and gastropods, 
two asymmetrical decamejs are assembled face-to-face with a 
collar complex at each end. In prosobranch and opisthobranch 
gastropods, and also in a bivalve, didecamers are often accom- 
panied by tube-like mult(decamers of varying length which 
consist in a didecamer with additional decamers assembled at 
one or both sides [1,2). 

The 400-kDa subunit of mollusc hemocyanins is a 
polypeptide folded like a pearl chain into seven or eight 
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globular substructures, the so-called 'functional domains' or 
'functional units' (FUs). termed FU-a to FU-h. All FUs possess 
a molecular mass of 50 kDa and each carries a binuclear 
copper active site, but they differ considerably in primary 
structure and immunological properties [1]. From Octopus 
dofleini hemocyanin, the complete amino acid sequence of the 
seven-FU subunit is available [3]. Moreover, the X-ray structure 
of an Octopus hemocyanin FU dimer has recently been solved 
at 2.3 A resolution [4]. From the more complex gastropod 
hemocyanins, much data on disassembly, reassembly, and 
oxygen binding behaviour is available [1] and a 15- A recon- 
struction of the didecamer has recently been derived from 
electron micrographs [5], but the sequence data are rather 
fragmentary. In recent years we and others have studied the 
hemocyanins intensively from two closely related marine 
gastropods, the keyhole limpet Megathura crenulata [6-13] 
and the abalone HaUotis tuberculata [14]. In both animals the 
hemocyanin occurs as two isoforms (KLHl and KLH2 in 
Megathura, HtHl and HtH2 in Haliotis)\ KLHl /HtHl and 
KLH2/HtH2 are interspecifically related. Keyhole limpet 
hemocyanin (KLH) is a widely used immune stimulator 
[9,11,12]. 

From hemocyanin-producing pore cells in the mantle con- 
nective tissue of HaUotis [15] we have purified rnRNA and 
constructed a cDNA library. By antibody screening, we recently 
isolated a cDNA clone encoding FU-f to FU-h of HtHl; 
moreover, the subunit organization of HtHl has been presented 
[14]. Here we report complementary experiments with HtH2 
which go a considerable step beyond the information available 
for HtHl : for HtH2. in addition to the three C-terminal FUs, the 
sequence of FU-d and FU-e has also been determined. 
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MATERIALS AND METHODS 

Purification of HtH2 

Maintainance of animals, hemolymph collection, and isolation 
of total hemocyanin was essentially as described previously 
[14]. HtH2, which is the minor hemocyanin component in 
H. tuberculata hemolymph, was purified from HtHI by anion 
exchange chromatography using a Mono-Q column on a HPLC 
apparatus (Applied Biosystems); elution was performed using a 
NaCl step gradient. Re~chromatography was required to obtain 
an electrophoreticaily almost pure HtH2 fraction. Dissociation 
of HtH2 oligomers into subiinits was achieved by overnight 
dialysis against 0.5 m glycine/NaOH buffer (pH 9.6) at 4 °C. 

Electron microscopy 

Conventional negative staining was performed by the single 
droplet procedure [16] as described elsewhere [14]. A Zeiss EM 
900 transmission electron microscope was used for this study. 

PAGE and Immunoelectrophoresis 

SDS/PAGE was performed according to Laemmli f!7J. For 
native PAGE, an alkaline system according to Markl et al [18] 
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was used, with 0.33 m Tris/borate ( P H 9.6) gel bufTer, and 
0.065 m Tris/boraie (pH 9.6) electrode buffer. Crossed, and 
crossed-line immunoelectrophoresis were performed as described 
previously [19,20]. Rabbit antibodies were raised against dis- 
sociated total HtH (i.e. HtHI + HtH2) and purified HtH2 by 
Charles River Deutschland (Kisslegg, Germany). The immun- 
ization procedure was as described by Markl and Winter [21]. 

Limited proteolysis and isolation of fragments 

Limited proteolysis was performed at 37 °C in 0.13 M glycine/ 
NaOH pH 9.6. by adding one of the following enzymes 
(Sigma) dissolved in 0.1 m NH 4 HC0 3 (pH 8.0): Staphylococ- 
cus aureus V8 protease type XVII (8400), bovine pancreatic 
elastase type IV (E-0258), bovine pancreatic a-chymotrypsin 
(C-3142), bovine pancreatic trypsin type XIII (T-8642). The 
hemocyanin concentration was between 1 and 10mg-mL~\ 
and the concentration of the enzyme was 2% (w/w). Proteolysis 
was terminated after 5 h by freezing at -20 °C. HPLC was 
performed on an apparatus from Applied Biosystems equipped 
with a model 1000S diode array detector. Proteolytic fragments 
were applied to a Mono-Q anion exchange column (Pharmacia) 
equilibrated with 0.02 M Tris/HCl (pH 8.0) and eluted by a 
linear NaCl gradient (0-0.5 M) in the same buffer, at a flow rate 




Hg. 1. Purification and characterization of 
HtH2. (A) Partial separation of HtHI and HtH2 
from total HtH (resuspended uttracentrifugation 
pellet) by anion exchange HPLC on a Mono-Q 
column, using a NaCl step gradient for elution. 
The first elution peak contained pure HtHI; the 
second peak contained mostly HtHl which was 
stil! contaminated with HtHI. By te-chromato- 
graphy of the second peak (data not shown), the 
remaining HtHI could almost quantitatively be 
removed. (B) Electron microscopy of negatively 
stained HtH2. showing mostly didecamers but 
also short multidecamers (i.e. uidecimers); 
bar = 100 nm (C-E) Immunoelectrophoresis 
using rabbit anti HtH antibodies. Prior to these 
experiments, HtHI and HtH2 were Associated 
into summits. In the first dimension, the anode 
was on the left. (C) Crossed immunoelectro- 
phoresis of total HtH as applied to the. experiment 
shown in (A); ami-HtHl + HtH2 antibodies from 
a rabbit were used. In the first dimension, the 
anode was on the left. (D) Crossed Immuno- 
electrophoresis of HtH2 obtained after 
re-chromatography; the same antibodies as in (C) 
were used. (E) Crossed-Hne Immunoelectro- 
phoresis of total HtH, with purified HtH2 in the 
line to demonstrate which precipitation peak has 
been obtained; the same antibodies as In (Q were 
used. (F) SDS/PAGE (7.5% polyaciytemide) of 
HiH2 proteolytic fragments; the anode was at the 
bottom. Lanes: 1, molecular weight markers 
(masses indicated); 2, total cleavage products of 
subunit HtH2 with elastase; 3, total cleavage 
products of subunit HtH2 with V8 protease; 6, 
total cleavage products of summit HtR2 with ' 
trypsin; 8, total cleavage products of subunit 
HtH2 with ehymotrypsin; 4, 5, 7, 9-12, various 
fragments enriched by HPLC (Fig. 2) and 
identified as described in the text and shown in 
Hg. 3. 
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of 1 mLmirT 1 . Alternatively, proteolytic fragments were 
isolated by excising bands from native PAGE gels [18] after 
inverse staining with the Roti- White system (Roth, Karlsruhe, 
Germany) as descibed by Fernandez-Patron et al. [22]. For 
subsequent cleavage with a second enzyme, isolated fragments 
were first dialysed overnight against 0.13 m glycine/NaOH, 
pH 9.6 to remove the NaCl. 

Amino acid sequence analysis 

Proteins from HPLC were denatured in SDS -containing sample 
buffer and separated by SDS/PAGE ([17]; 7.5% polyacryl- 
amide). To avoid N- terminal blockage, 0.6% (v/v) thioglycol 
acid was added to the cathodic buffer [23]. Protein bands were 
electro-transferred to ProBlot membranes (Applied Biosystems) 
[12]. Detection of individual polypeptides on the membranes 
was performed by Ponceau S staining. Polypeptide bands of 
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Fig. X Separation of proteolytic cleavage products of subunit HtH2 by 
HPLC. The products of limited cleavage with V8 protease (A), chyrno- 
trypsin (B), and trypsin (Q were applied to a Mono-Q column and el u ted 
by a iinear NaCl gradient Components in the fractions were identified by 
SDS/PAGE (Fig. IF) and crosses-line Immunoelectrophoresis (Fig. 3) as 
described in the text. Those elution peaks fiom which a component was 
used in the final analysis are indicated by labelled arrows. Labels without a 
bracket indicate that the component was sufficiently pure to be analysed 
directly. Labels in brackets indicate that further purification by native PAGE 
was required. 



interest were cut out and sequenced in a 477A protein 
sequencer apparatus from Applied Biosystems by H. Heid 
(DKFZ, Heidelberg, Germany). The quantity of polypeptide 
applied to the sequencer was in the lower pmol range. 

cDNA cloning and sequence analysis 

A lambda cDNA expression library constructed from poly(A*) 
RNA of Haliatis mantle tissue using vector lambda ZAP 
Express™ according to the instructions of the supplier (Strata- 
gene) was available in our laboratory [14]. Clones were isolated 
using a digoxigen in -label led cDNA probe of 580 bp length 
corresponding to the C-terminal region of HtHl-g and the 
N-terminal region of HtHl-h (for sequence analysis of cDNA 
coding for HtHl, see [14]). cDNA sequencing was performed 
by commercial services on both strands using the Taq Dye 
deoxy Terminator system. Sequence alignment was performed 
with the clustal w (1.7) and treeview software packages 
[24,25]. 



RESULTS 

Isolation of HtH2, and analysis of its subunit organization 

Separation of HtHl and HtH2 by anion exchange chromato- 
graphy is shown in Fig. 1 A. If the HtH2 fractions still contained 
some HtHl, this impurity was removed by re-chomatography. 
When examined by electron microscopy, purified HtH2 appeared 
mostly as didecamers, but the occasional tridecamer was also 
present (Fig. IB). This contrasts with HtHl which consists 
exclusively of didecamers [14]. Purification of the HtH2 sub- 
unit was monitored by dissociation of each chromatographic 
fraction, at alkaline pH, into subunits followed by crossed, and 
crossed-line Immunoelectrophoresis using anti-HtHl + HtH2 
antibodies raised in a rabbit (Fig. 1C-E). Compared to the 
starting material shown in Fig. 1C, after the chromatographic 
purification procedure only a trace of HtHl was left as a 
contaminant (Fig. ID). Crossed-line immunoelectrophoresis of 
total HtH subunits in the starting hole with the chromato- 
graphicaliy isolated subunit material (Fig. ID) in the line 

Table 1. N-terminal sequence of isolated FU fragments of HtH and 
KLH. For more data on KLH1, KLH2 and HtHl, see [9,12,14]. The 
respective position in the solved primary structure of HtH2-defgfr is 
indicated by position numbers (for HtH2-d and HtH2-e, see Fig. 4; for 
HtH2-g. see Fig. 5; for HtH2-h, see Fig. 6). Note that in HtH2-e, cleavage 
by chymotrypsin of an Arg-Pro bond is not very probable; according to 
Fig. 4, one would expect the sequence *LRPD. However, for unknown 
reasons, the enzyme also removed 4 LR 5 from HtH2-e. In the case of 
HtH2-g, uyplic cleavage between FU-f and FU-g is more likely at the 
4l *Arg-Ala bond; however, the expected N-termina! sequence 4,7 AAD- 
SAHS 422 (Fig. 5) was apparently removed from HiH2-g by additional 
cleavage at ^Scr-AH. 
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Fig. 3. Analysis of the subunit organization 
of HtH2. For iironunoelectrophoresis, 
anti-HtH2 antibodies raised in a rabbit were 
used throughout; in the first dimension, the 
anode was on the left. (A) Crossed Immuno- 
electrophoresis showing a panel of eight 
different FUs as obtained by limited elastase 
proteolysis of purified subunit HtHJ. 
Identification of the various peaks as HtH2-a to 
HtH2-h was the net result of a large number of 
experiments as described in the text and below. 
(B-H) Crossed line Immunoelectrophoresis of 
elastase -cleaved subunit HtH2 as shown in (A), 
with HPLC-isolatcd proteolytic fragments of 
subunit HtH2 in the line to identify'to which 
precipitation peak(s) they correspond (for 
HPLC profiles, see Fig. 2; for SDS/PAGE of 
fragments, Fig. IF). Identification of the 
fragments was achieved by N-terminal 
sequencing (Table 1). 
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demonstrated that the bulk profein was indeed HtH2 (Fig. IE). 
N-terminal sequencing of the HtH2 subunit revealed a single 
primary structure which confirmed that this material was 
sufficiently homogeneous for further analysis; the sequence 
obtained showed similarity to FU-a from HtHl, KLHi and 
notably KLH2 (Table 1). 

The seven or eight functional units (FUs) constituting a 
molluscan hemocyanin subunit usually lack immunological 
cross-reactivity as deduced from crossed immunoelectrophor- 
esis, and this observation is in accord with the considerable 
difference in primary structures [9,12,26,27]. With isolated 
HtH2 subunit (Fig. ID), we used low concentrations of four 
different proteases (elastase, V8 protease, trypsin, and chymo- 
trypsin), which in HtHl, KLHI and KLH2 had cleaved the 



exposed polypeptide chain between neighbouring FUs, without 
destroying peptide bonds within the FUs [9,12,14]. The cleav- 
age products were studied by SDS/PAGE (Fig. IF). Elastase 
produced mostly fragments migrating on SDS/PAGE close to 
50 kDa, which corresponds to single FUs (Fig. IF, lane 2). The 
other three enzymes also yielded some 50-kDa products, but 
in addition produced FU diroers, trimers and larger fragments 
of the HtH2 subunit (Fig. IF, lanes 3, 6 and 8). 

HPLC (Mono-Q) separation of the V8 protease cleavage 
products of subunit HtH2 yielded two fused elution peaks 
(Fig. 2A). In a fraction at the leading edge of the second peak, a 
fragment of 150 kDa was highly enriched (Fig. IF, lane 4). 
This component was purified by blotting* sequenced and 
exhibited the same N-terminal sequence as the intact subunit 
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RtH2-d gAra YDEWTXASH nUO^Pr^GHVESLRSfl.f^LaHD GVYEMIAKFB3KPCLCD--DJJGRKVACCV7TGM 70 

OdH-d BACIWEPVTSAK RI1U01LK&LTDGPMKSLRKAFKPMTTD- -ORYEBI AflFBGLPAQCPMKDGSKVYTCC IBOM 17 

HtH2-e SSPLRTOGHS^WLVJWEVN^rra^ 75 

OdH-e SADTK3E-BOHX YLVRM3\raRL^SEMNSI>IHAFRR>K)RD^ 74 



Fig. 4. Multiple sequence alignment of FU-d 
and FU-e from HtH2 and Octopus baenso- 
cyanln. The linker regions between neighboured 
FUs are underlined. On the top of each sequence 
block the secondary structure elements of OdH-g 
according to Miller et ah [3] are indicated; for 
their position in the X-ray structure 14], see 
Fig. 7. Strictly conserved residues are marked by 
an asterisk (identical); isofunctional exchanges 
are marked by either a colon (very similar) or by 
a point (similar). Copper ligand hjsudtnes am 
marked by reversed asterisks. Potential disulfide 
bridges and N-glycosylation sites are indicated 
by black boxes. The alignment was performed 
with the clustal software. The OdH sequences 
are taken from Miller el ah [3]. 
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HtH2-/ NTRSKSP-TOVRRFI^Pl^XRDI>5SLKSAIJU)L0EDDG PNCYQMAAFBCL?AQCHDSHGMgIACCIHGKPTrP& 74 
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Fig. S. Multiple sequence alignment of FU-fg 
from HtH2, HtHl and Octopus haemocyanin. 
Codes are the same as those used in Fig. 4. The 
HtHl sequences are from Keller et al. [14]; the 
OdH sequences are from Miller et al. f3J- 



(Table 1) and therefore represented HtH2-abc. Crossed 
immunoelectrophoresis of the elastase cleavage products of 
subunit HtH2 showed eight discrete immuno-precipitation 
peaks (Fig. 3A), which on correlation with the results 
from SDS/PAGE (Fig. 1 F, lane 2) represent the individual FUs. 
In crossed -line immunoelectrophoresis, HtH2-abc fused com- 
pletely with three of the eight immunoprecipttation peaks of the 
elastase cleavage pattern (Fig. 3B). 

HPLC separation after chymotryptic cleavage of the intact 
subunit yielded several fused elution peaks (Fig. 2B). The last 
peak almost exclusively contained a fragment of ^ 100 kDa 
(Fig. IF, lane 9), which showed the same N-terminal sequence 
as HtH2-abc (Tajble 1) and therefore represented HtH2-ab. 
Crossed-line Immunoelectrophoresis of this HPLC fraction 
revealed a single protein, which completely fused with two FUs 
from the elastase cleavage pattern (Fig. 3C). Both are also part 
of HtH2-abc (Fig. 3B), and therefore must represent HtH2-a 
and HtH2-b. This, by deduction, identified the third component 
incorporated in HtH2-abc as FU-c (Fig. 3B). Isolated HtH2-ab 
(Fig. IF, lane 9) was cleaved by elastase, resulting in several 
products; the major protein could be isolated by excising its 
band from a native PAGE gel. In SDS/PAGE, it showed a 



molecular mass of 50 kDa (Fig. IF, lane 10), and its 
N-terminal sequence identified this FU as HtH2-a (Table 1). 
The precipitation peak of HtH2-a was revealed by crossed-line 
immunoelectrophoresis (Fig. 3D); thus, the still remaining 
component of HtH2-abc must be HtH2-b (Fig. 3B). At this 
stage of the analysis, FU-a, FU-b and FU-c were identified in 
the pattern of crossed immunoelectrophoresis (Fig. 3 A). 

A fragment of « 45 kDa found in the first HPLC elution 
peak after chymotryptic cleavage (Fig. 2B) was further 
enriched by excising its band from a native PAGE gel. After 
this procedure, the eluted protein showed a satisfactory purity 
in SDS/PAGE (Fig. IF, lane 11). Similarly, from the second 
HPLC peak of the chymotryptic cleavage products a 55-kDa 
fragment was obtained (Figs 2B and IF, lane 12). Both frag- 
ments were N-terminally sequenced (Table 1), and then identi- 
fied as HtH2-d and HtH2-e by aligning their sequences to the 
primary structure derived from a cDNA coding for HtH2-defgh 
(see below). Subsequently, they were assigned to their respec- 
tive inununoprecipitation peak in the elastase cleavage pattern 
of subunit HlH2 by crossed-line Immunoelectrophoresis (Fig. 
3E,F). As a consequence, five of the eight FU precipitates were 
now identified (Fig. 3A). 



© FEBS 1999 



1 sUUUlu 



Abalone haemocyanin suwrlit structure {Eur. J. Biochcm. 265) 139 



ntm-h 
Htm -/j 



-Bi- 



te- &L 



grrEVTRQHTOGNAH-FKBKrVPSL^LDHAI^^ 74 
GIWED-EHHrXBUJU^TWCEVPPL^gANAI^ 74 



HtH2-A 



CMPV7PHWBKLKTIQMERXUCKHCS PMG IPYWDWTKiaiS SLPSFFGD SSWNNPFYlCmitCVOHOTRfiVNQlU-? 149 



Fig. 6. Sequence alignment of FU-h from 
HtH2 and HtHl. The." linker region between 
FU-g and FU-h is underlined, the start motif of 
the unique C-terminal extension of FU-h is also 
underlined and marked by bold letters. Codes are 
the same as those used in Fig. 4. Note that 
compared to other FUs, the third disulfide bridge 
is missing fn both FU-h versions. Also note 
conservation of two cysteines in the C -terminal 
extension which probably allow formation of an 
edditionai disulfide bridge. C-termimis of 
polypeptide chain. The HtHl sequence is from 
Keller et al {14}. 
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HPLC separation after tryptic cleavage of the intact HtH2 
subunii yielded four fused elution peaks (Fig. 2C). In 
SDS/PAGE, the first peak showed a strong protein band of 

45 IcDa (Fig. IF, lane 7). By N-terminal sequencing and 
alignment to the sequence of HtH2-defgh it was identified 
as HtH2-g (Table 1). This FU fused, in crossed-line 




N 



Fig. 7. Schaematlc view of the X-ray structure of OdH-g, showing 
positions of the secondary structure elements indicated in Figs 4-6. 
Arrow. asparagioe-Iinked carbohydrate side chain (molecular mass 
« 1000 Da). Drawn from Miller et at. [3] and CufT et al f4]. 



Immunoelectrophoresis, with one of the three remaining 
peaks of the elastase cleavage pattern (Fig. 3G)> which -vas 
thereby identified, in the total pattern, as HtH2-g (Fig. 3A). The 
first HPLC peak of the V8 protease cleavage products of 
subunit HtH2 (Fig. 2A) contained almost exclusively a 
»* 60 kDa fragment (Fig. IF, lane 5). It fused, in crossed-line 
immunoelectrophoresis, with one of the two yet unidentified 
FU precipitates of the elastase cleavage partem (Fig. 3H). 
According to its N-terminal sequence in comparison to the 
primary structure of HtH2-defgh, this component was identified 
as HlH2-h (Table I). After this approach, there was just a single 
precipitate left which could not be isolated but in all probability 
represents FU-f; consequently, the entire FU organization of the 
HtH2 subunit was deciphered (Fig. 3A). 

Sequencing of a cDNA clone encoding five functional units of 
HtH2 

A cDNA expression library constructed from H. tuberculata 
mantle tissue was screened using a cDNA probe encoding a 
region of HtHl -gh (see Methods). Cross-hybridization of this 
probe resulted in isolation and sequence analysis of several 
new cDNA clones of varying length which all encoded the 
C-terminal region of the same hemocyanin polypeptide; it 
was clearly different from HtHl. The largest of these cDNA 
clones contained 6726 bp. The cDNA sequence is not shown 
here, but is available in the EMBL data bank (accession number 
A JO 12048). This clone, termed kth2-defgh % contains an open 
reading frame for a polypeptide of 2160 amino acids (Figs 4-6) 
with a calculated molecular mass of 248 kDa. The translated 
region must represent the C-terminal part of the polypeptide 
and not a random fragment, because the open reading frame is 
terminated by a slop codon followed by a untranslated region of 
243 bp with a polyA tail. Alignment with olher moHuscan 
hemocyanin sequences, notably from Octopus hemocyanin [3] 
and HtHl [141 revealed that the encoded polypeptide 
encompass five complete FUs, and comparison to the 
N-terminal sequences shown in Table 1 identified it as part of 
HtH2. It starts in the linker region between HtH2-c and HtH2-d 
and contains the complete amino acid sequence of HtH2-d, 
HtH2-e, HtH2-f, HtH2-g and HtH2-h (Figs 4-6). From our 
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Fig. 8. Matrix of percentage similarity (top) 
and percentage diversity (bottom) In 
compared sequences from molluscan 
haemocyanm FUs. Multiple sequence 
alignment was performed with the clustal 
software. The sequences are derived from 
haemocyanin of the prosobranch gastropods 
H. tuberculoid (HtH) and Rapana thomasiana 
(RtH), the pulmonate gastropod Helix pomatia 
(HpH£). and the two cephalopods O. dofleinl 
(OdH) and Sepia officinalis (SoH). For sources, 
see [3.14,29.30.40]. 



analysis of the FU organization of the HtH2 subunit (see above) 
as well as from the absence of a start codon and sequence 
coding for a signal peptide, we are sure that clone hth2-defgh 
does not encode the entire hernocyanin polypeptide and that the 
three N-terminal FUs are still missing. A multiple sequence 
alignment of HtH2-d f HtH2-e and the corresponding FUs in 
Octopus dofleini hernocyanin [3] is presented in Fig. 4. In 
Fig 5 a multiple alignment of HtH2-fg with HtHl-fg [14] and 
OdH-fg [31 is shown. In Fig. 6 we compare the sequences of 
HtH2-h and HtHl-h [14]. For clarification, Fig. 1 shows the 
published X-ray structure of OdH-g [4]. Figure 8 shows the 
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Fig 9 Radial phytogenetk tree of molluscan haemocyanin FUs. This 
unrooted tree is based oo a CLUSTM. multiple alignment of the currently 
available sequence (for sources, see [3.1 4.29.30,40]). For abbrevtauons see 
Fig 8 Booistiap percentages arc based on 1000 replicates. Only bootstrap 
values > 50 are shown. Note that with the exception of RtH2-a. topo- 
togically corresponding FU rypes group as a common branch. Assuming 
that me branching node of gastropod and cephalopod haemocyanin hes 520 
million years ago, thi common origin of the eight FU types .date* back 
~ 650 million years, and the branching node of HtHl and HtH21.es - 300 
million years ago. 



structural similarities and diversities calculated from a multiple 
alignment which includes the 19 complete sequences of 
molluscan hernocyanin FUs currently available, and Fig. V 
presents the unrooted, radial representation of a phylogenetic 
tree constructed from the same alignment. 

DISCUSSION 

The HtH 2 subunit Is segmented into eight different 
functional units 

Our data show the existence of eight different FUs in HtH2, and 
reveal their sequential arrangement within the elongated 
subunit (Fig. 3A). Implications of such analyses are discussed 
in detail elsewhere [9.12.14], but it is clear that understanding 
the FU organization is a prerequisite for all future structural 
studies on a particular hernocyanin. With respect to HtH2, it 
should be pointed out that in this case, for the first time, such an 
analysis was accompanied directly by cDNA sequencing which 
greatly helped to decipher the FU arrangement. Although the 
HtH2 pattern differs, in electrophoretic details, from all other 
molluscan hernocyanin FU panels analysed so far (whicfc i are 
different internally and also from each other, e.g. [9,12,14,26,27]). 
the presence of eight different FUs has clearly emerged as 
being broadly applicable to gastropod hemocyanins. This has 
now also been established for the subunits of KLH2 and RtH2, 
which have previously been proposed to contain, respectively, 
seven [9] and five [28] different FUs; more recently both have 
been shown to possess eight FUs (W. Gebauer et al. 
unpublished data). Moreover, it is evident that FU-h is an 
additional FU type found only in gastropods, bearing in mind 
that Sepia FU-'h' is known to correspond to Octopus FU-g [1]; 
this FU is therefore termed here SoH[h]-g. as proposed recently 
[14]- 

Functional units HtH2-d and HtH2-e 

FU-d from the pulmonate gastropod Helix pomatia was the first 
molluscan hernocyanin polypeptide to be sequenced [29] and it 
has been repeatedly compared in detail to the more recently 
obtained FU primary structures, notably from Octopus haemo- 
cyanin [3]. However, this early sequence may still contain 
errors, although some corrections have recently been published 
[30]. Therefore, in the present alignment, we concentrate on 
comparing the primary structures of Haliotis and Octopus 
hernocyanin (Fig. 4), both of which have been deduced entirely 
from their cDNA sequences. The primary structure of HtH2-e 
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derived from cDNA clone hth2-defgh is the first complete 
sequence of a gastropod bernocyajiin functional unit of type e. 
The only other available sequence of this FU type comes from 
Octopus hemocyanin (Fig. 4). 

Alignmeni of the FU-d and FU-e protein sequences from 
HtH2 and OdH shows a structural similarity of 52% for 
HtH2-d/OdH-d and 56% for HtH2-e/OdH-e, whereas compari- 
son of FU-d to FU-e gives a significantly lower score of 42% 
and 44%, respectively (Fig. 8). This type-specific structural 
similarity is also evident from the four deletions/insertions by 
which FU-d differs from FU-e (Fig. 4). An alignment of all 
available molluscan hemocyanin sequences revealed that the 
gaps in FU-d, which in; HtH2 are defined by ^HSP 43 and 
,63 GGHVFDN 169 , are caused by specific deletions in FU-d, 
whereas 144 EG 145 is an insertion in FU-e (shared by OdH-b 
[3]). The fourth variable motif, defined in HtH2-d by 
332 SGEQNCEN339, seems to be rather susceptible to inser- 
tion/deletion events as deduced from the seven Octopus FU 
sequences [3], but also in this case, conservation of sequences 
specific for FU-d on the one hand and FU-e on the other is 
impressive (Fig. 4). It should be noted that all of these 
insertions/deletions occur at positions which correspond, in 
the X-ray structure of OdH-g [4], to irregularily structured 
regions between a-helices or ^-sheets (indicated in Fig. 4 and 
shown in Fig. 7). 
Except for occasional amino acids, HtH2-d and OdH-d differ 

E? y in i2§ fCW Sh ° rt motifs ' for exam P le ,21 QQT 123 versus 
TLL (Fig. 4); however, they are also located between 
secondary structure elements [4]. Interestingly, the mouf 
357 FDR359 wh5q h Miller et al [3] found in five of the seven 
Octopus hemocyanin FUs but not in OdH-d, is present in 
HtH2-d (Fig. 4). The only somewhat variable sequence of FU-d 
within a^condary structure element is 382 DLSI 385 versus 
RVVT (strand 011), but even in this case, one hydro- 
phobic residue has been conserved. Thus, FU-d from Octopus 
and Haliotis are very similar structurally (with the exception of 
possible carbohydrates: see below), and there is little doubt that 
they represent corresponding structural and functional elements 
within their respective native hemocyanin oligomers. 

There are only a few regions in which HtH2-e differs 
considerably from OdH-e, notably the sequence motifs 
l,9 IWSNRDF ,2 \ l63 GGHVFDN J69 and ^RADST^ 6 . How- 
ever, as deduced from the X-ray structure of OdH-g (Figs 4 
and 7), all three motifs are located in loops connecting 
secondary structure elements. A fourth region of considerable 
variability is the tail 4I4 FQRGHR 420 which, however, is part of 
the presumably exposed linker region between HtH2-e and 
HtH2-f. On the other band, there are several motifs which are 
identical in both FU-e versions and different in all other 
sequenced FUs. for example ^SFGAP 269 . And ultimately 
HtH2-e and OdH-e have exactly the same number of amino 
acids (420). All of these structural features, together with the 
overall similarity of the two sequences and the stricdy con- 
served insertions/deletions, clearly suggest that in the archi- 
tecture of their respective bemocyanin, HtH2-e and OdH-e play 
a very similar structural and functional role. 

An obvious difference between HtH2-de and OdH-de con- 
cerns potential asparagine-linked carbohydrate attachment 
sites. FU-d and FU-e from both species possess such a site in 
exactly the same position close to the C-terminus (Fig 4) 
where in OdH-g the loop between strands 011 and 812 is 
localized (Fig. 7). However, in OdH-d but not in HtH2-d an 
additional site for N-giycosides exists, at a position which 
corresponds, in the X-ray structure of OdH-g, to the Iood 
between between ct'l 1 and otl2 (Figs 4 and 7). The other way 
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round, HtH2-e possesses a second site for N-giycosides (between 
strands P4 and £5) which is absent in OdH-e (Fig. 4). In this 
context it should be noted that the amount of carbohydrate side 
chains per FU varies considerably among the molluscan 
hemocyanin FUs sequenced so far, with up to three potential 
sites and an unexpected variety of monoantennary and dian- 
tennary side chains detected in Helix pomatia (see [3,31]). 

Functional units HtH2-f, HtH2-g, and HtH2-h 

The existence of two very different hemocyanin isoforms in 
some marine gastropods was first indicated in 1981 [32], and 
later, many structural and functional differences have been 
revealed [6,9-14,33,34], but this is the first opportunity to 
compare them at the level of their primary structures. The 
structural similarity of FU-fgh from both HtHl and HtH2 is 
striking (Figs 5 and 6); it is 64% for FU-f, 74% for FU-g, and 
60% for FU-b (Fig. 8). Only the linker regions between 
neighbouring FUs and several loops connecting a-helices or 
^-sheets show considerable variability, for example between 
a'6 and a7 (Fig. 5) or between alO and a'll (Fig. 6). 

Compared to the corresponding sequences in Octopus 
hemocyanin, structural conservation is 49-51% for FU-f and 
57-59% for FU-g (Fig. 8). Such a degree of invariance between 
gastropod and cephalopod hemocyanin could probably be 
expected for the six 'wall forming FUs* (i.e. FU-a to FU-f), 
which according to current models of the quaternary structure 
are involved in the cylinder wall of the decamer [J J. However, 
for FU-g as a 'collar/arc forming FU\ it is rather unexpected 
that its invariance is even higher, because the central collar of 
cephalopod hemocyanins differs considerably from the peri- 
pheral collar/arc complex of gastropod hemocyanins, an 
observation which has to date prevented construction of a 
common model of the quaternary structure [1,5,35,36,37]. In 
Octopus hemocyanin decamers, 10 FU-g copies form the 
central collar [36,37]. This structure seems to correspond* in 
gastropod hemocyanin decamers, to the arc which lies under- 
neath the peripheral collar [5,35]. On the other hand, it is 
known that in gastropods, the collar/arc complex is formed by 
10 copies of FU-g and FU-h, respectively [1]. From the 
phylogenetically rather conserved primary structure of FU-g 
(Fig. 5) it can now be predicted that this FU type fulfils a very 
similar structural role in cephalopods and gastropods, which 
would mean that in gastropod hemocyanin decamers it forms 
the entire arc, leaving FU-h as the ODJy element for the 
peripheral collar. This is useful information for the major 
unsolved problem of the molluscan hemocyanin quaternary 
structure: the exact path and orientation of the multi-FU subunit 
within the decamer [5,36]. 

As with HtHl-h [14], HtH2-h also shows the peculiar 
C-terminal tail of 95 amino acids (Fig. 6), which is absent 
from other FUs. Characteristics of this polypeptide elongation 
have been discussed previously for HtHl-h [14]. In the present 
context it should only be pointed out that even its length is 
extremely conserved in both HtH isoforms. Provided that this 
elongation is some kind of anchor to other structural elements, 
its invariance in both hemocyanins suggests that their peri- 
pheral collars are much more similar than proposed previously 
for the equivalent proteins in Megathura, KLHI and KLH2 
[9,12]. 

Potential N-glycosylatlon sites and possible consequences 
for assembly 

In Megathura, KLH2 forms multidecamers; this is not observed 
with KLHI, at least not in vivo (e.g. [6]). Also in Haliotis, a 
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portion of native HtH2 is assembled beyond the didecamer 
(Fig. IB), whereas native HtHl is fully restricted to the 
didecameric state [14]. An interesting aspect in comparing 
the sequences of HtHl-fgh and HtH2-fgh is therefore to find 
specific differences which could explain this behaviour, and the 
most obvious difference is their potential asparagine-1 inked 
N-glycosylation sites (sequence motifs NXT and NXS, see 
Figs 4-6). HtHl-f and HtH2-f both show such a site close to 
the C-terminus. between strands 311 and (J12, but HtHl-f has 
an additional N-glycosylation site between helices al and ct'3, 
which is close to the N-terminus. In HtHl-g there is a site for 
N-glycosides in the region of the first disulfide bridge as in 
OdH-g. and another one between strands f*4 and 05, whereas 
HtH2-g has a single site between strands 01 1 and 012. HtH2-h 
shows a single N-glycosylation site between sheet £12 and 
helix a' 17. In contrast in HtHl-h, two rather unusual N-linked 
carbohydrate sites exist. The first is localized, according to the 
X-ray structure, between helices a/6 and a7, a region of the 
core domain where in Octopus only FU-c has a site for 
N-glycosides. The second is totally unique (as far as sequences 
are available yet) and localized far outside in the so-called 
'P-sandwhich domain' [4], in a region defined by the loop 
between stands 08 and 09 (Figs 6 and 7). 

A gastropod hemocyanin decamer wall consists of three 
fenestrated tiers, with the peripheral collar anchored to the 
upper tier and the internal arc attached to the central tier. Tn a 
didecarner, two decamers are assembled 'face-to-face* at their 
lower tiers. In a tridecamer, an additional decamer is attached, 
with its lower tier to the upper tier of a didecamer. FU-g 
appears to be restricted to the arc (see above) and then would be 
buried too deeply inside the cylinder to interact with a 
neighbouring decamer. This is different for FU-h, 10 copies 
of which seem to form the collar, and FU-f which is thought to 
be involved in the upper tier of the cylinder wall [1,8,35,36]. 
Thus both FU types could well interact with the lower tier of a 
neighbouring decamer, and the additional carbohydrate side 
chains anchored to HtHl-f and/or HtHl-h could probably be 
involved in blocking multidecamer formation in HtHl, whereas 
absence of these side chains in HtH2-f and/or HtH2-h would 
allow such an oligomerization in HtH2. However, this concept 
is highly speculative, and it does not explain why multidecamer 
formation in KLHI from electrophoretically and immuno- 
logically intact subunits could be achieved in vitro [33,34]. 
Future deglycosylation experiments under native conditions as 
well as the sequence analysis of KLHI and KLH2 should help 
to substantiate these ideas. 

Implications for the evolution of molluscan hemocyanin 

Of the protostorne phyla, the molluscs are among the most 
ancient and diverse, and molecular data of their hemocyanin 
might be useful to examine their major divergencies which are 
still much debated (for example [38,39]). In Fig. 9, an unrooted 
radial representation of the phylogenetic tree is presented, 
because so far no outgroup is available; using tyrosinase as the 
outgroup as suggested by Miller et al [3] was not found to 
improve the tree, because the relationship of tyrosinase and 
molluscan hemocyanin is too remote. It has been suggested that 
the molluscan hemocyanin subunit evolved by three successive 
duplications of the proto-gene for the functional unit, each 
followed by gene fusion [1]. However in their phylogenetic 
analyses, Miller et al. [3] found it to be impossible to resolve 
the evolutionary branching orders among the FUs, indicating 
that they evolved very rapidly from their ancestral precursor. 
Indeed, from sequence alignments which include the new HtH2 
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primary structures it was also not possible to trace these early 
events, because the branching pattern of the different FU types 
is highly unstable (Fig. 9). However, reconstruction of the tree 
with different methods only affected nodes that in the present 
tree are not bootstrap-supported, but did not influence the 
conclusions based on the bootstrap-supported branches. In the 
tree, FU-d, FU-e, FU-f. and FU-g from both molluscan classes 
form four stable branches (fig. 9). This confirms that at least 
these four FU types already existed long before gastropods and 
cephalopods separated, and that the members of each branch 
are equivalent in structure and function [1], The only exception 
is RtH2-a [40], because its incorporation in the FU-g branch is 
very stable (Fig. 9). Sequence analysis of FU-a from other 
gastropod haemocyanins is required to approach this specific 
problem. 

In fossil records, the first gastropods occur in the earliest 
Cambrian, and the first cephalopods appear somewhat later, 
about 520 million years ago [41]. This separation event between 
gastropods and cephalopods might be used to calibrate a 
molecular clock (for an appropriate formula, see [42]), although 
the sequence sirnilariries shown in Fig. 8 suggest that this cloclc 
may run at different rales among the different FU types. For 
example, FU-f from HtHl and HtH2 share 64%, FU-g 74% and 
FU-h 60% similarity (Fig. 8), and this may well reflect different 
structural and/or functional constraints influencing the evolu- 
tion rate of the various FU types. Therefore it is probably too 
early for a reliable overall calculation to be obtained. Never- 
theless, from the distances given in Fig. 8 for FU-g as the most 
completely analysed FU type, we estimate that the multiple FTJ 
subunit originated from a proto-gene about 650 million years 
ago (for method, see (42]). This fits recent calculations from 22 
nuclear genes, which suggest that the major branches of tHe 
triploblastic metazoans separated about 830 million years ago, 
long before the Cambrian explosion, 540 million years ago 
[421. Tn this context it should be considered that the elongated 
subunit is the prerequisite for oligomerization, and that in turn 
oligomerization is required for hemocyanin to function as an 
extracellular blood oxygen carrier [1]; therefore, the multiple 
FU subunit most probably evolved when triploblastic metazo- 
ans reached a level of complexity and activity that required 
such a carrier protein. According to the estimation above, it 
took them *= 180 million years to reach this point, and then 
another 100 million years to develop forms which could b>e 
fossilized. The distance values between HtHl-g/HtH2-g and 
HpH0-g suggest that separation of prosobranch and pulmonale 
hemocyanin occured about 400 million years ago. Indeed, the 
first fossil pulmonale and opisthobranch gastropods occur in trie 
Carboniferous, some 360 million years ago [41]. According to 
the distance between HtHl-g and HtH2-g (Fig. 8), separation of 
their ancestral genes happened about 300 million years ago, 
either in the late Carboniferous or the early Permian. Aji 
outburst of phylogenetic change among the archaeogastropods 
in the laier Paleozoic [41] is probably the background of the 
evolution of two distinct hemocyanin forms within the same 
animal. 

It should be appreciated that these are very preliminary 
calculations; the complete hemocyanin subunit sequences from 
several molluscan classes will be required to obtain more 
reliable data. A comparative combination of the rapidly increas- 
ing knowledge of the primary structure of the two hemocyanin 
isoforms of Haliotis with the sequence of Octopus haemo- 
cyanin [3), the X-ray structure of OdH-g [4], and the 15 A 
structure of the KLHI didecamer [51 is a scientifically mean- 
ingful way to approach the remaining questions on the 
quaternary structure, evolution and function of these intriguing 
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copper-containing respiratory proteins. Moreover, determi- 
nation of more molluscan hemocyanin sequences could well 
assist tracing ancient events in molluscan phytogeny. 
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